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Abstract 

For a complete graph of size n, assign each edge an i.i.d. exponential variable with mean n. 
For A > 0, consider the length of the longest path whose average weight is at most A. It was 
shown by Aldous (1998) that the length is of order logn for A < 1/e and of order n for A > 1/e. 
Aldous (2003) posed the question on detailed behavior at and near criticality 1/e. In particular, 
Aldous asked whether there exist scaling exponents [i, v such that for A within 1/e of order n~ M , 
the length for the longest path of average weight at most A has order n u . 

We answer this question by showing that the critical behavior is far richer: For A around 
1/e within a window of a(logn) -2 with a small absolute constant a > 0, the longest path is 
of order (logn) 3 . Furthermore, for A ^ l/e + /3(logn) -2 with j3 a large absolute constant, the 
longest path is at least of length a polynomial in n. An interesting consequence of our result 
is the existence of a second transition point in 1/e + [a(log7i) _2 ,/3(logn) _2 ]. In addition, we 
demonstrate a smooth transition from subcritical to critical regime. Our results were not known 
before even in a heuristic sense. 

1 Introduction 

In this work, we study the stochastic mean-field distance model. For a complete graph G = (V, E) 
of size n, associate each edge e G E a non-negative weight X e which is an independent exponential 
variable with mean n. For A > 0, let L(n, A) be the length of the longest path whose average weight 
is at most A. It was shown by Aldous [2] that with high probability (with probability tending to 
1 as n — > oo) L(n, A) = O(logn) for A < 1/e and L(n,A) = 8(n) for A > 1/e. Aldous [3] posed 
the question on the detailed behavior of L(n,X) at and near criticality 1/e. In particular, Aldous 
asked whether there exist scaling exponents fi, v such that 

nT^Lin, e -1 + xrT u ) — > mix) (1) 

in probability for some deterministic function m{x) satisfying 

lim mix) = oo, lim mix) = 0. 

X— >oo x— > — oo 

We show in this work that the critical behavior for the stochastic mean-field model is different 
from and far richer than that questioned as in ([T]). Our first result determines the order of L(n, A) 
at criticality as well as establishes the right order for the critical window, as incorporated below. 

Theorem 1.1. There exist absolute constants a,C,c > such that for all e _1 — (logn) -2 A ^ 
e -1 + a(log n)~ 2 

P (c(log n) 3 < L(n, A) ^ C(log n) 3 ) -> 1 , 
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Remark. In a recent private communication, Aldous made a guess that L(n,l/e) = n°^ l \ which 
is confirmed by the preceding theorem. 

Our second result shows a lower bound of polynomial in n on L(n,A) if (A — l/e)/(logn) 2 
exceeds a large absolute constant. 

Theorem 1.2. There exist absolute constants f3, C > such that for all A ^ e _1 + /3(logn)~ 2 

P (ra 1/4 ^ L(n, A) Cn(X - e" 1 )) -> 1 , 

Remark. It seems more careful analysis can improve the lower bound to n 1 / 2+ °^ 1 \ We chose not 
to do so because we believe it is still far away from being tight and thus the improvement is only 
technical. 

Interestingly, Theorems 11.11 and 11.21 imply that there is yet another phase transition occurring 
somewhere in l/e + [a (log n) -2 , /3(log n)~ 2 ]. In addition, we demonstrate a smooth transition from 
subcritical to critical regime. 

Theorem 1.3. There exist absolute constants C, c > such that for all A e _1 — (logn) -2 , 
P ^(e" 1 - A)" 1 log n < L(n, A) < C^ 1 - A) -1 logra) -> 1 . 

Related work. While our work focuses on the second order behavior (or finite-size scaling in the 
language of statistical physics), the first-order behavior was studied by Aldous [4]. It is believed 
that L(n,X)/n — > 5(A) in probability as n — > oo for some function 5(A). Indeed, we see that 
5(A) = for A < 1/e. In [4J, a non-rigorous derivation of 5(A) using a reformulation of the cavity 
method gives that 5(A) x (A — 1/e) 3 . 

In addition, the quantity L(n, A) studied in this paper is a natural variant of several other 
objects that were studied before. If we consider the path of small maximal weight other than 
average weight, this is an extensively studied question of the longest path in Erdos-Renyi random 
graphs. For G ~ G(n, c/n) (a random graph obtained by preserving each edge in complete graph 
with probability c/n independently), Ajtai, Komlos, and Szemeredi pQ proved that there is a path 
of length a(c)n where a(c) > for c > 1 and a(c) —> 1 as c — > oo; a similar and slightly weaker 
result was shown independently by Fernandez de la Vega [7J. Later, the attention was shifted to 
the asymptotics of 1 — ct(c). Improving a previous work of Bollobas [5], Frieze [8] obtained a sharp 
estimate on the asymptotics of 1 — a(c) as c — > oo. In addition, it is not hard to see that for c < 1 
the longest path is of order log n, and for c = 1 it can be deduced from a result of Nachmias and 
Peres [12] that the longest path is of order n 1//3 . 

Here we fix a maximum A for the average weight and try to maximize the length for the longest 
path that satisfies this constraint. If we reverse the optimization (i.e., we insist on a path that visits 
every vertex and minimize the average weight), it becomes the classic traveling salesman problem 
in the mean-field setting. For this question, Wastlund [14] established the sharp asymptotics for 
more general distributions on the edge weight, confirming the Krauth-Mezard-Parisi conjecture 

ma ehe!. 

Finally, the maximal size T(n, A) of the subtree whose average weight is at most A was studied 
in PJ. It was shown that T(n, A) transitions from o(n) to Q(n) at some critical point Ao whose 
value can be specified in terms of a fixed point of a mapping on probability distributions. 

Main ideas of the proofs. We view the problem from a slightly different perspective. We first 
fix a length I and compute the minimal average weight of all paths of length l\ then we vary I to 
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match this minimal average weight with A. A simple and useful fact is that the minimal average 
weight is increasing with I at least in a coarse sense (c.f. Claim [23]) ■ 

With the aforementioned perspective in mind, our proof ideas can be traced back to Bramson's 
celebrated work [6], which gives a very precise evaluation on the minimal displacement of the 
branching Brownian motion. The main obstacle is that, we do not have a real tree structure in the 
mean-field setting (one could argue that it is locally tree-like, but certainly not globally), which is a 
crucial component in Bramson's argument. The lacking of a tree structure poses the challenges on 
how to control the correlation between different paths (say, of the same length) and how to select 
the truncation function for the second moment calculation, where the two issues are intrinsically 
related to each other. In what follows, we discuss the solution to these challenges focusing on the 
case of A = 1/e. 

The solution arises from the following observations. Note that there are two opposite forces 
on the (maximal) deviation of the partial sums from the expectation for a typical path with small 
average weight. First, the deviation cannot be too large since otherwise there would exist a path 
whose average weight is too small, but that is unlikely due to a first moment calculation (c.f., 
Lemma |2 . X [) . Second, the deviation cannot be too small for long paths since conditioning on the 
average weight of a path, the partial sums behave like a Brownian bridge and thus typically exhibits 
a deviation of order -y/path length (c.f., Lemma l2.3p . These two forces together, leaves no other 
possibility but that the path of small average weight is short. 

Crucially, the aforementioned deviation of the path serves well as the truncation function (for 
the proof of the lower bound). Observe that a bad event which produces large probability for the 
average weights of both two paths to be small, is that the weights on the common edges for these two 
paths are unusually small. However, once restricted to paths of small deviation, the total weight 
of the common edges cannot differ much from the expectation (given the average weight of the 
path) and it is indeed bounded by the maximal deviation multiplied with the number of segments 
induced by these common edges (c.f., Definition I2.8() . Another important ingredient is that the 
number of pairs (of the paths) decreases rapidly with the number of segments of the common edges 
(c.f., Lemma l2.10p . Altogether, this allows us to control the correlation globally, and thus provides 
a way to prove the lower bound. 

Discussions and further questions. Our work suggests a number of open questions. Naturally, 
one could ask what is the location and behavior for the second phase transition. The main obstacle 
for identifying the transition location seems to be that we have to select different truncation func- 
tions for the upper and lower bounds in the proof. More importantly, the probability costs for these 
two truncations are hugely different. The argument of Bramson also adopts different truncations 
(the so-called upper and lower envelops), but the probability costs for these two turn out to be of 
the same order in that case. 

It would also be interesting to determine the right order of L(n, A) in the regime of Theorem ll.2i 
The lower bound we obtained there seems to be far away from being tight. The main limitation 
of our arguments is that, we rely heavily on the fact that the number of pairs (of paths) decreases 
rapidly with the number of segments for the common edges assuming a fixed number of common 
edges. This stops being true once the path under consideration gets too long. 

An alternative direction is on the refined estimate at criticality. In particular, does 



in probability for some £ > 0? If so, what is the limit and what is the variance of L(n, 1/e)? 




(2) 
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A word on notation. Throughout the paper, we denote by C, c > absolute constants whose 
value could vary from line to line. Other absolute constants like a,/3,C*,c* are fixed once for all. 
As we have different regimes to consider, we usually fix the value of the parameter A and possibly 
other parameters in each section/subsection, and all of these settings for values will appear at the 
very beginning at each section/subsection. 

2 Critical behavior within scaling window 

In this section, we study the critical behavior within scaling window and prove Theorem 11.11 



2.1 Deviation of typical light path 

For a path 7 = vo, e\, v\, . . . ,e£,V£ where Vi-i and V{ are endpoints of e$ for all i E [£] (of course 
a sequence of edges would already uniquely specify a path, but we purposely choose to emphasize 
both vertices and edges for a path in this work), let 

*(7) = YLlX* (3) 

be the (total) weight of 7. Clearly, X(-f) follows Gamma distribution, which is of central importance 
throughout the work. Let Z ~ T(0,k) be a Gamma variable with parameter (6,k), that is to say, 
Z has the same law as a sum of k i.i.d. exponential variables with mean 6. We will repeatedly use 
the density function fe t k(z) of Z, where 

fe,h{z) = zk ~ l e uk'l 1){ ail O 0,0 > 0,* € N. (4) 

We first show that the average weight of a path cannot be significantly smaller than 1/e. For 
convenience of notation, denote by the collection of all paths of length £, for any £ E [n]. 

Lemma 2.1. Let E be the event that there is a path of length £ with weight at most £/e — logn by 

E = U? =1 U 7er , {X(rf) ^ e~H - log n} . (5) 

Then P(E) -)• 0, as n -)■ 00. 

Proof. For any £ E [n], we have ^ n e+1 . In addition, by the probability for each 7 E Ti 
has total weight less than e -1 ^ — logn is bounded by 

— Ijn 

P (X( 7 ) < e^£ - logn) < 10 • (e" 1 ^ - logn)^ 1 * = 0( n -^) , 

nr (1—1)1 

where the last transition follows from Stirling's formula. An application of a union bound over 
and then over £ E [n] yields the lemma. □ 

Define M(j) to be the deviation of 7 away from the linear interpolation between the starting 
and ending edges, by 

M( 7 ) = sup 1<fc<< |£f=i*i - |X( 7 )| . (6) 

By Donsker's theorem, it is not hard to say that the deviation process {5Zi=i X — ^^(7)} con- 
ditioned on the value of ^(7) converges to a Brownian bridge after suitable normalization. The 
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deviation of a Brownian bridge, i.e., the maximum of the absolute values, is known to have Kol- 
mogorov distribution, where the law can be written down explicitly as a sum of series. In particular, 
its left tail area has been obtained in 1 13 1 as follows: 



P(max oa , 1 |^| < 5) = ^ + °* (1) e-^ , (7) 

d 

where (-Bt)o<t^i is a standard Brownian bridge, and ^ 0,5(1) J, as 5 — > 0. The analog to 
deviation of Brownian bridge gives convincing evidence for the type of decay for the lower tail 
of M(7). However, as we are trying to analyze the tiny probability for a rare event, the desired 
estimate could not follow directly by convergence in law. We give a proof in what follows, without 
aiming at optimizing the exponents for the decay. We start with the next simple claim. 

Claim 2.2. For i.i.d. exponential variables Zi with mean 9 > and m ^ n/2, let Z = ^27=1^1 
and Z' = YliLi ^i- Let g(-) be the density function of Z' conditioned on Z = On. Then for all 
I4.\z-6m\^ lOVOm, 

1. /a m N 3 > / 



lO 6 V0m ay ' \fdrn, ' 

Proof. Let fk,e(~) be density function of Gamma distribution as in By Bayesian formula, we 
obtain that 

/ x fm,0{ z )fn-m,0{0n-z) _ (0„ - z ^n-m-l e -(6n-z)/e 9 n {n-l)\ 

9\ z ) — 



fn,e{° n ) 6 m {m-l)\ e n - m {n-m-l)\ (0n) n - l &- n 

Now the claim follows from a direct computation with an application of Stirling's formula. □ 

Lemma 2.3. Let Z{ be i.i.d. exponential variables with mean 6 > for 1 ^ i ^ n. For 1/4^/9^4, 
consider the variable 

M = M n = sup 1<fe<n |^ =1 Zj - pk\. (8) 
Then, there exist absolute constants c, C* > such that for all r ^ 1 and n r 2 , 



e -c*n/r* ^ p(M ^ r J Y™ =1 Zi = pn) sC e 



■c*n/r 2 



Proof. First observe a useful property for exponential variables: for i.i.d. exponential variables Y% 
with mean 9\ and i.i.d. exponential variables Z{ with mean 62 for any #i,#2 > 0, we have that for 
all k £ N and z > 

(Yi,Y 2 , ...,Y k \ YH =l Yi = z) ^ (Z U Z 2 , ...,Z k I YH=xZi = z) , (9) 

since both vectors are uniformly distributed over {(z\, . . . , z k ) E : Y2i z i = z } conditioning on 
the sums being z. Therefore, we can in what follows assume that 9 = p. 

We now give a proof for the upper bound. For convenience, we assume that r is a positive 
integer. The intuition (also for the lower bound) is that we can divide n into blocks of size r 2 , 
and in every such a block the fluctuation of the path is of order r and thus the probability for 
the path in this block to stay within [—r,r] is bounded away from and 1. Since the number of 
blocks is n/r 2 , this gives the right type of decay for the lower tail of fluctuation. Precisely, for 
j = 1, . . . , [n/r 2 \ — 1, define the event Qj by 

Qj = ^(j-l)r^k^jM\T,i=l Z i ~ P k \ < r ) ■ 
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It is clear that 

HQj+l | Qi,...,Qj) ^ P (lEo-i)r^Kir^i " H < 2r | Qi,... Qj) ^ 1 - 10- 7 , 
where the last step follows from Claim [2T2l This yields that 

P (M ^ r I £" =1 ^j = /m) < P (Qj : VI < j < [n/r 2 \ - l) < (1 - 10^ 7 ) L?a/r ' 2j _1 . 
Now we turn to the proof of lower bound. For j = 1, 2, . . . , [fi/V 2 ] > define the event 

Rj = - Pjr 2 \ < r/2}, and 5, = n {j _ 1)r2 ^ >2 {|£t (i _ 1)r2 Z 4 - - (j - l)r 2 )| < r/2}. 

It is clear from triangle inequality that 

ni<j< ri / r 2_ 1 i^- n n 1<j<n/r 2Sj c {m < r} . (10) 

Furthermore, by Claim [2T21 again, we get that (write 1Z = rii^^ n / r 2_ 1 i?j) 

PW > n W n/r»-lP(^ I ^1. ■ • ■ . Rj-l) > !0~ 8n/r2 • (11) 

In addition, conditioned on YjiL(j-i)r 2 ^ i = s ' we see ^at ^^^i=(j-iyX ^i) f° r 1 ^ ^ ^ 1 
converges to a standard Brownian bridge, and thus (see ([7])) for r ^ ro where ro is a large absolute 
constant, we have that P(£j | 7?., X^=(j_i) r 2^i = s ) ^ 10~ 2 • Trivially, for r ^ ro we have P(5j | 
^'Ei-fj-ijr 2 ^ = s) ^ 10~ r °. Since given the sum in each block, the variables in different blocks 

are independent, we can then deduce that P(nj£fSj | K) > I0~ r ° n / r2 , Combined with JTD]) and 
(fTTI) . this gives the desired lower bound with C* = 3tq. □ 



2.2 Upper bound 

In this subsection, we prove the upper bound for Theorem I l.li Recall that c* > is the absolute 
constant defined in Lemma 12.31 Set 

« = (12) 

Fix in this subsection 

A = X a = e _1 + a(log n)~ 2 . 

By monotonicity, it suffices to give an upper bound on L(n,A). We start with a simple claim, 
reducing the consideration to paths of length between [£, 2£) for the purpose of showing L(n, A) ^ £. 

Claim 2.4. The following holds deterministically. Suppose that there exists a path of length L ^ £ 
such that the average weight is £ for some £ > 0. Then there exists a path of length between [£, 2£) 
such that the average weight is at most £. 

Proof. Suppose 7 is the path of length at least £ and of average weight We can divide 7 into 
a collection of edge-disjoint paths where one of the paths is of length between [£, 2£) and all the 
others are of length precisely £. It is obviously that at least one of the divided paths must have 
average weight at most (, as required. □ 
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We now show that there cannot exist a long path with small average weight but even moderately 
large deviation. 

Lemma 2.5. For £ = (logn) 3 /a, we have 

F(3£ < £' < 2£,-f G I> : X(i) < A/,M( 7 ) ^ 3 logn) ->0. 

Proof. Suppose there exists £ ^ £' < 2£ and 7 G IV such that X(^) $C A£' and M( 7 ) ^ 3 logn. 
Denote by 7 = vq , ex , v\ , . . . , egi , vgi , and let £* be such that 

M(7) = l££i*<-£*(7)|. 

Let paths 71 = Vq, e%, . . . ,eg*,vg* and 72 = vg», eg* + \, . . . , eg/,vgr. By our assumption on 7 and 
definition of £*, we have 

either X( 7l ) < e _1 r - logn or X( 72 ) < e" 1 ^' - f ) - logn . 

This implies that 

{3 7 G I> : X(i) ^ X£',M(j) ^ 31ogn} C 
where E is the event defined in ([5]). The desired estimate now follows from Lemma |2. 11 □ 

We next turn to control paths with small deviation. 
Lemma 2.6. For £ = (log nf /a, we have 

P(3£ < £' < 2£,j G : X( 7 ) ^ A£,M( 7 ) sC 3 logn) 
Proof. Fix an £' with £ ^ £' < 2£ and fix 7 G I>. By (gj) and Lemma EH we obtain that 

P(X( 7 ) < A/,M( 7 ) < 31ogn) = o(i) n -^'/-V2 e 2eiogn e - c Mogr l /(9a) = o(l)rT 1 ' n~ e , 

where the last transition follows from the definition of a in (|12p . Noting that |r»| ^ n^' +1 , we deduce 
the desired result by first applying a union bound over 7 G Ygi and then over £ ^ £' < 2£. □ 

The upper bound for Theorem 1 1.1 1 is an immediate consequence of Lemmas I2.5l and l2. 61 together 
with Claim E2D 

2.3 Lower bound 

In this subsection, we prove the lower bound for Theorem 11.11 Fix A = e _1 — (logn) -2 in this 
subsection, and it suffices to establish the lower bound on L(n,A). Let c, 5 > be two small 
absolute constants to be selected. For 1 ^ £ ^ n and 7 G Tg, define 

F 1 = { \£ - 1 sC X(i) ^ X£, Af (7) 5 log n} . (13) 

By (Jl|) and Lemma 12.31 we obtain that for all 7 G Ti with £ = c(logn) 3 

P(F 7 ) ^ ^ n - l £- l / 2 n- ec - cC */ &2 , (14) 

where C* is the absolute constant from Lemma 12.31 Defining 

N = E 7 er, 1 i ;, 7 ' 

we see that the first moment of N would be large if we select j3, ( properly. The key issue here is 
to bound the second moment of N. 
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Lemma 2.7. Consider £ = c(logra) 3 . For any 7 S Ti, we have that 

Eyer/C^ n F i') < P ( F 7) ■ ( EAr + 0(1)*W) . 

In order to prove the preceding lemma, one needs to study the correlation structure between 
7 and all other paths in T^. In order to have a global control of the correlation between 7 and all 
other paths, a natural strategy is to first select different scales of correlations and then estimate 
the cardinality of the paths that fall into each scale. This strategy was implemented in the case of 
branching Brownian motion where the scale for correlation is chosen to be the number of common 
edges between a path and the path 7. In the mean-field setting, we do not really have a tree 
structure (as two paths can bifurcate and merge and then bifurcate...). In addition, merely the 
number of common edges does not seem to fully characterize the correlation between two paths. 
Therefore, we need to choose an auxiliary quantity which together with the number of common 
edges, offers an effective measurement of the correlation. We elaborate in what follows. 

For a path 7, denote by £(7) the collection of edges in 7. For S C £(7), we call a segment of 
7 an S-component if it is a maximal segment of 7 where all the edges belong to S. 

Definition 2.8. For two paths 7 and 7', we define a functional 0(7, 7') to be the number of S- 
components 0/7 where S = £'(7) n E(~/). 

The functional #(7,7') turns out to be a good additional measurement for the correlation 
between 7 and 7'. Given a collection of edges S, denote by V(S) the collection of vertices which 
are endpoints for edges in S. The next simple observation is of crucial importance for our proof. 

Lemma 2.9. For 1 < £ < n and 7, 7' G T e , Write S = E(j) n E(j'). We have 

\V(S)\ = \S\+9( 1 , 1 '). 

Proof. By definition, there exist no edge in S that crosses different S-components of 7, and there 
exists no vertex in V(S) that is belonging to different S'-components. Therefore, we can analyze each 
S-component separately. In addition, it is obvious that for each such S-component, the number of 
vertices is larger than the number of edges by 1. Summing over all the S'-components, we complete 
the proof of the lemma. □ 

Given a path 7 £ T#, we now partition based on its correlation with 7, i.e., based on the 
tuple (#(7,7'), 1-2/(7) H £'(7') |). Precisely, for all integers i ^ j, define 

Aij = Aij( 7 ) = {7' e : 0(7,7') = i,E( 7 ) n E{i) = j} . (15) 

It is now natural to control the cardinality of Aij. One could prove more precise estimate on Aij, 
but for our purpose, the following is sufficient. 

Lemma 2.10. For any 1 ^ £ ^ n and any 7 6 Ti, we have that for any non-negative integers i ^ j 

Proof. In order to bound |^4«j|, we consider the following procedure to generate a path in Aij: 

1. Select i vertex disjoint segments from 7 such that the total number of edges is j (and thus 
the total number of vertices is i + j by Lemma I2.9p . 
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2. Select £ + 1 — i — j vertices from remaining n — i — j vertices in the graph. 

3. Choose a direction for each of the segment (two options for every segment). Then take each of 
the i segments as an individual element and permute these i elements and £ + 1 — i — j vertices 
selected from step 2, such that no more edges in 7 will be introduced when the permutation 
is viewed path. 

Clearly, the cardinality of Ai a is bounded by the product of the number of choices Nf. (for k = 1, 2, 3) 
in each step. For Step 1, we see that the choice for the edges is complete determined by the 2i 
endpoints selected from the path 7 and vice versa, and thus N x ( £ + 1 ). For Step 2, we have 

N 2 = {(Xi^i-j)- For Step 3 ' li is obvious that N 3 ^ ^(i + I - Taking a product for iVi, N 2 
and N$, we complete the proof of the lemma. □ 

We are now ready to give 

Proof of Lemma \2.7\ For i = c(logn) 3 and 7 G Ti, we consider Ti as a union over A^j. For i = 0, 
the only legitimate choice of j is also 0, and in this case E(^')C\E(^) = for all 7' G A 00 . Therefore, 
the events F 7 and F{^') are independent. Thus, 

£y e Ao,o F ( F 7 n F y ) = Ey e A ,o F ( i? 7) " P(*V) < n^)EN . (16) 

Since \E(-y) n ^(7')! ^ ^(7)7') always, we next consider 1 ^ i ^ j ^ £. Take 7' G -Ajj, and we 
wish to bound the probability for the event Fy conditioned on F 7 . Write S = £^7) n E{^') and 
5' = E(pf) \ S. Conditioned on F 7 (see the definition of F 7 in (fT3"j) ). we have that 

£ ee5 X e ^ |5|A-l-2i<yiogn. 

Therefore, we obtain that 

P(Fy I F y ) < P (Eees'^e < A|5'| + 1 + 2Wlogn | F y ) =P (Eees'^e < A|5'| + 1 + 2i,51ogn) , 

(17) 

where in the last transition we used independence of these exponential variables X e . Recalling that 
\S'\ = t- j, we get from (j3J) that 

P(Fy I F 7 ) < 0(l)rT^V i5 . 
By Lemma l2.10i we see that A{j ^ (I + l) 3 *n f+1_J_ - J . A simple union bound then yields that 

Summing over 1 ^ z ^ j ^ £, we then obtain that 

Ei^KiEy^,/^ I ^7) < 0(l)£ 5 n 25 . 
Combined with (|16p , it completes the proof of the lemma. □ 
We next conclude this subsection with the proof for the lower bound on L(n, A). 
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Proof of Theorem ] 1. 1\ lower bound. Set 5 = A min(l, 1/(7*) and c = 5 3 . Since \Tg\ = (1 + 
o(l))n i+1 , the estimate (|14|) then gives that for sufficiently enough n 

EN ^ n 4 / 5 . 

Meanwhile, by Lemma \'2.7\ we have that 

EN 2 < EN(EN + n 1 / 5 ) = (1 + o(l))(EiV) 2 . 

At this point, a simple application of Chebyshev's inequality gives that P(iV > 0) — > 1 as n — > oo, 
completing the proof for the lower bound. □ 



3 The existence of second transition point 

Throughout this section, we let A = e _1 + e and assume that e ^ /3(logra)~ 2 for an absolute large 
constant (3 ^ 10 to be specified later. The goal of this subsection is to demonstrate the existence 
of another phase transition at a point in [e _1 + a(log n)~ 2 , e _1 + /3(log n)~ 2 ]. To this end, we prove 
Theorem 11.21 in this section, whose main content is a polynomial lower bound on L(n, A) when 
A-e" 1 ^ /3(logn)" 2 . 

The upper bound for Theorem 11.21 follows from a straightforward first moment computation, as 
quickly incorporated in what follows. 

Proof of Theorem ] 1.2c upper bound. Consider I ^ 6en. For any 7 S Tg, we have from @ that 
P(X( 7 ) < U) = 0(l)((e- 1 + e)e) i - 1 T7 ^ ZT¥ = 0(1/V^)e^ . 

In addition, we get that |r^| = n!=o( n ~~ ^ n e+1 e~^. A simple union bound over and 
Gen ^ £ ^ 12en then gives that 

P (36en ^ £ ^ 12en, 7 e T £ : Jf (7) ^ A£) . 

Combined with Claim this gives the upper bound. □ 



3.1 Deviation of typical light path: revisited 

In view of the lower tail of the deviation as in Lemma 12.31 it is obvious that when the length of 
the path gets large, the tail gets extremely small and thus needs to be tracked down carefully. In 
particular, we need an estimate for the lower tail of the deviation given the values of some of the 
variables along the path. We handle this delicate issue in this subsection. 

Lemma 3.1. Let Zi be i.i.d. exponential variables for i G N. Let 1/4 ^ p ^ 1 and M n be defined 
as in §S$) . Write for all s G N 

p s =P (M s ^ r I ZLiZi = H ■ (18) 

Then for j, k £ N, we have 

P > +k > W *rVTAk P > Pk ■ 
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Proof. Assume that j ^ k. The proof follows from a natural idea: conditioning on the partial sum 
of the first j variables. Given that this partial sum is close to the expectation within a window of 
size 1 (which occurs with probability 1 / y/j A k as j A k is the variance for this partial sum), the 
two segments are independent and the probability for each of them to have deviation smaller than 
r is very close to pj and p k . Noting that the probability for the whole sequence to have deviation 
smaller than r is larger than the product of the three aforementioned probabilities, we can then 
complete the argument. In what follows, we carry out the technical details. 

Denote by Q s = {(zi) : J2i=i z i = p(j + k ), E;=i z i = PJ + S P}> and by ft = U ^i/2^s- By 
Claim [2T2l we see that 

n(^)efl)^. 

Just for the technical reason (which will be clear later), we partition £Iq into a union of sets £Iq t 
such that (zi) G S7o,t if and only if r = min{i ^ j + 1 : Z{ ^ 5p}. Let 3 G MP +k be such that for all 
( Zi ) G 3 

IEi=i^ - ps| < r for all 1 ^ s < i + A;. (19) 
Since sequences in 3 has small deviation, we see that E C U^LifloT- Choose r* such that 

P((Zj) £Hn!l , T .) = max P((Z) eSn(l , T ) ^ ^P(Z, esn(] ). 
Next, we show that for all ^ 5 ^ 1/2, we have 

p((Zi) g s I (^) g n*) ^ ^p ((z) 6 En n 0)T * | (z») g n ) . (20) 

For (zi) G ^o,r* H 3, we map (z{) to ^ by let (z^) G be such that z[ = Z{ for i 7^ j, r* and 
^- = Zj + 5/9 and z' T * = z T * — 5p (the assumption that z T * ^ bp guarantees that Zj + i 0). Since 
^ 5 ^ 1/2 ^ r, it is clear that the sequence (2^) also satisfies (fl~9j) . Also, we see that the 
determinant of the Jacobian matrix of this mapping is 1. It remains to compare the densities for 
(Zj) at (zi) and (z^) given (Zj) G fio and (Zj) G ft<5, respectively. It is obvious and straightforward 
to check that the ratio of these two densities are within a constant factor, say, 10. This yields that 

Pj+k > ^P((z 4 ) g n)p ((Zj) g 3 n ft ,T* I n ) > w ±- /1 p j p k , 

where the last transition we used conditional independence given Qq. Altogether, this completes 
the proof of the lemma. □ 

Lemma 3.2. Let Zi be i.i.d. exponential variables for i G N. Consider 1 ^ r ^ y/n and 1 ^ a\ ^ 
b\ ^ 02 ^ • • • ^ o-m ^ b m ^ n suc/i i/iai a = — cij + 1) ^ n — lOr. Lei 1/4 ^ p ^ 1 and M n 

be defined as in . T/ien for all Zj such that 

we /iaue (write A = U^LJaj, 6j] n N and use t/ie notation of p s as in (I18p j 

P (M n ^ r I ^ =1 Zi = pn, Zj = z-ij G A) < 0(r y/q An — q)p n 10 100mr e C * q ^ r2 , 
where C* is the absolute constant from Lemma \2.3[ 
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Proof. We first sketch the outline of the proof. Since we are conditioned on the sum of Zi, by ([9]) the 
mean of Zi is irrelevant. Just for convenience, some times we assume that Zi has mean p. We will 
consider a new sequence of i.i.d. exponential variables (Z-), whose average will also be conditioned 
to be p. In addition, the size n' of the new sequence is larger than the number of free variables 
(the variables that are not conditioned to be a given value) in the old sequence (Zi) by lOmr. For 
each segment [aj,6j], we force a segment of size lOr in (Z[) such that its partial sum (biased by 
p for each variable) grows almost linearly with end points being and X^e[a b ]( z j ~ P) ( sa y the 
probability cost is p). Given this linear interpolation, the free variables in (Zi) and (Z[) will have 
almost the same distribution and we couple them together. Furthermore, if the free variables are 
such that the deviation in (Zi) is less than r (say this occurs with probability p'), so should it be 
in (Z'A by our construction. But we know that the probability for (Z[) to have deviation smaller 
than r is p n >. Therefore, we can deduce the bound p n > ^ p • p'. The technical details are carried 
out in what follows. 

Let (Z'^i^i^ni be i.i.d. exponential variables where n! = n — q + lOmr. We first define the 
mapping (fi(-) between the coordinates of the original sequence (Zi) and our new sequence (Z'A, by 

cj)(t) = \i <; t : i G" A\ + Wr\i :bi^t\. 

Note that <j)(t) remains constant over [aj, bi). The intuition behind is that we replace each segment 
[ai, bi] in the original sequence by a segment of size lOr. Write Sj = Y^j= ai z j ~ Pfti — a« + 1), for 
1 ^ i ^ m. By definition, we have |sj| ^ 2r. For ^ Si ^ 1/20, we define ^tsA Q I& n such that 
(xj)i^j^n' £ Q>5i if for all 1 ^ i ^ m 

VI < k < lOr : - 1/20 < £j =1 a^ (6i) _ 10r+i " Kp + «</10r) < 0, 
Y^X x <t>{bi)-lQr+j = Wr P + Si- Si. 

The idea, as we outlined before, is that we use the added segment to linearly interpolate the (biased) 
endpoints of the old segment while we keep the total sum (biased by mean) to be very close. Denote 
by = U^s i )^[o^/20] m ^(8 i )- A repeated application of Claim E2] gives that 

P((^-)i W ' e n) > io- 80mr . 

Next, consider E C M. n such that (xj) G S if, 

Vjf G A : xj = Zj, and V/c G [n] : \ Yl^=i x k ~ pk\ ^ r, and Y^=i x j = P n ■ 
Just as a translation of our definition of H, we have 

P [M n < r | Ei=i^j = ^ = ^Vj G A) = P ((^)i^Kn 6 S I Er=i^i = Z™' ^ = ^ e A ) ■ 

(21) 

Now for each (Si) and (^jOi^^n' £ ^(a^, we construct a mapping ip(Si),(ujj) '■ ^ ^ ^(^) such that it 
maps (zjOi^n G E to (yj)i^j^n' 

Mj £ A: yj = ujj ,Mj G" AU {h, . . . , b m } : y <t>j = xj, and y<f> b .+i = x bi+1 + Si . 

Crucially, the density of (Zj)\^j^ n at (xj)i^j<^n given that {Zj = Zjij G A} and Ylje[n] ^3 = P n ^ 
is comparable with the density of (^)i<j<n' a * an Y (Uj)i^j^ri given that (2*-)i$j< n ' 6 ^(6*) an d 
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Uj = ojjij G A and Y^j=i z 'j = P n '■ Indeed, the ratio between these two densities can be directed 
computed and founded to be within a factor of 10 m . In order to see this, note that given these 
conditions, the two random vectors have the same number of free variables and the sum of these 
free variables differ by amount of order m. 
Define H' C W 1 ' such that if (y,) G 5', 

(yj) Gfi,VfcG [n] : 1X^=1%' ~ M < r > and YJj=iVi = P n ' ■ 
By definition, we can verify that for every (uy) G ^(5;) 

^ w (E)e3'far all (^€[0,1/20]™. 
We can then finally conclude that 

P((^)l^GE'|Eili^ = Pn') 

> P (^) g n) minP((Z^)i^j^ n / G E' | Y^ij=l Z j ~ P n ' -> i Z j)l^j^n' G ^(<$ 4 ) 

> 10 -8rm 10 -mp ((^-) 1<Krl G E | Z,- = ZjVj A, E"=l^n = pn) . 

Combined with f)21 1) . it follows that 

P (M n < r | YZ=l Z i = f™> Z = z ^ G 10 (8r+1) >n' • 

Now the desired estimates follows from Lemma 13. II and Lemma 12.31 □ 

3.2 Lower bound 

Throughout this subsection, fix A = e _1 + /3(logn)~ 2 for a large absolute constant f3 > to be 
specified. Write I = n 1//4 . Let ( > be a small constant to be specified later. For 7 G 1^, define 

G 7 = { A£ - 1 < X( 7 ) < A£, M( 7 ) < C log n • X( 7 ) /A^} . (22) 

Remark. Note that given that the event G( 7 ) occurs, we always have M( 7 ) ^ Clog re + 1- The 
extra seemingly funny factor of X( 7 )/A-£ is not crucial in the definition of GL. It is merely for the 
purpose to have the following: (which will save us some tedious effort) 

P(M( 7 ) < Clogn • X(i)/\e | X( 7 ) = z) = constant for all X£ - 1 < z < A^ . (23) 

Property (|23[) follows from the fact that for all z > 0: 

{i(X e ) ee7 | X( 7 ) = Z } /a =" {(X e ) ee7 | Z( 7 ) = 1} , 

which one can verify by definition of exponential variables and (J1J) . 
By and Lemma |2.3| we obtain that 

P(G 7 ) > ^n-'t ^ 2 exp ((e/3 - C*/C 2 )£Qog n)~ 2 ) , (24) 
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where C* is the absolute constant from Lemma [2.31 Defining N = X^eiy 1g t > we see that the first 
moment of N would be large if we select c, 5 properly. In particular, we have 

EN = P(G 7 )|r,| = (1 + o(l))P(G 7 )r/ +1 ^ fii^r^exp ((e/3 - G7C 2 K(logn)- 2 ) . (25) 

As in Section 12, 3[ the key issue is to bound the second moment of N. We use the basic ideas 
in Section [2. 3 \ with more delicate analysis. One of the main difficulties is that now the probability 
cost for the truncation on the deviation is so large such that it has to be tracked down throughout, 
requiring delicate estimates on the deviations of light paths (as incorporated in Section I3.1|) as well 
as a careful treatment when patching estimates together. 

Lemma 3.3. For any 7 G and 7 ' G A^j with 1 ^ i ^ j, we have that 

P(Gy I Gy) ^ P(G 7 )0(y/£/(£ - j))n% 300 ^e( c */ ?2 - e/3 ) J '( logn )" 2 . 

Proof. Write S = E{pf) n E(^') and S' = E(j') \ S. Conditioned on G 7 (see the definition of G 7 in 
([2"2j) ). we have that 

Eees x e >X\S\-1- 2i( log n = Xj-l- 2i( log n . 
Therefore, we obtain that 

P(Gy I G 7 ) = P (X£ - 1 ^ X(V ) ^ XI I G 7 ) -P (M (Y ) < C log n ■ Xtf) /XI | G 7 , Xi - 1 < X( 7 ') < M) . 
It is clear that 

P(A£-1 < X( 7 ') sC A^ I G 7 ) < P(EeeS' X e < A|5'| + 1 + 2iClogn | G 7 ) 

= P (Eees'^e < A|5'| + 1 + 2iClogn) . 

Recalling @ and that \S'\ = i — j, we get that 

P (X£ - 1 < X{i) ^ Xi I G 7 ) ^ 0((i - 7)- 1 /2) n -(^i) e ^(^')(iogn)- 2 n 2Ci _ 
By Lemma I3T21 and property (f23j) . we obtain that 

P(M( 7 ') < Clogn-X( 7 ')/A^ I G^iX- 1 < X( 7 ')) 

s£ P (M( 7 ) < Clogn • X(j)/X£ \X£-1^ X( 7 ) < Ai) a/j A (n — j) n 300Ci e c*j(Ciogn)-2 _ 

Note that 

P(G 7 ) = P (XI - 1 X( 7 ) < Xi) P (M( 7 ) < C log n ■ X(j)/Xi \ Xi - 1 ^ X( 7 ) ^ XI) , 
P(Gy I G 7 ) =P(Ai-l < X(i) < A^ I G 7 )P(M(Y) < Clogn -X^')/X£ \G 7 ,X£-1^ Xtf) < Ai) . 

Combining the last four displays together, we complete the proof of the lemma. □ 
We next conclude this subsection with the proof for the lower bound on L(n, A). 
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Proof of Theorem lower bound. Set 

C = 1/10 4 , and $ = 10 8 max(C*, 1) . (26) 

By (l25l) . we see that 

EN > n 3/4 . 

Next, we turn to bound the second moment of N. For 7 G 1^, consider Yi as a union of Ai j over 
^ i ^ j ^ £. For all 7' G A),o> the events G 7 and ^(7') are independent. Thus, 

Sy e A ,o P ( G '7 n Gy) = Ey 6 a,.o P ( G 7) • P ( G 7') < P(G 7 )EiV . (27) 
We next consider 1 ^ i ^ j ^ I. For 7' G -Ai?, Lemma 13.31 and (I26p gives that 



P(Gy I G 7 ) ^ P(G 7 )0(v^ - J) n i^/10 e -10i(logn) _ 
Combined with Lemma 12.101 it follows that 

£y eAi .P(Gy I G 7 ) < P(G 7 )^ +1 n- i/8 e- 8i{logn) " 2 = EN ■ n - 4 /V 8j(logn) ~ 2 . 
Summing over 1 ^ i ^ j ^ £ and recalling (|27l) . we obtain that 

E(iV I G 7 ) < (1 + O(n- 1 / 10 ))EiV , 

and therefore 

EiV 2 = E 7er /(G 7 )E(iV | G 7 ) = (1 + O(n- 1 / 10 ))(EiV) 2 . 

At this point, a simple application of Chebyshev's inequality gives that F(N > 0) — > 1 as n — > 00, 
completing the proof for the lower bound. □ 

4 Smooth interpolation through near sub-critical regime 

In this section, we demonstrate a smooth interpolation from sub-criticality to criticality by proving 
Theorem 11.31 The proof uses similar ideas as within the critical window and is also simpler. We 
write a separate proof in order to reduce distractions for the presentation of the core ideas in the 
critical regime. The proof in this section will be presented in a concise way. Throughout, we let 
A = e -1 - e ^ e" 1 - (logn)" 2 . 

To prove the upper bound, we see that for £ = e^ 1 logn (by Claim [2~i|) . 

P(L(n, A) ^ i) P (31 l' < 2£, 7 G I> : X( 7 ) ^ \t') ^ E^'<2^' +ln ^ ' ^ = • 
For the lower bound, consider £ = 10 4^"i)£ ' anc ^ ^ or ^ e ^ define 
H J = {\£-1^ X( 7 ) ^ \£, M( 7 ) sC log n/10} . 
Then, by and Lemma 12.31 we have that 

EN > ^+i_i_ n -^ e -^-iooc^/(iogn)2 ^ 
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It remains to control the second moment of N. Analogous to derivation of (|17p . we obtain that for 

Thus, by Lemma 12.101 we obtain that for all 1 ^ i ^ j 

EyeA^HHy | H,) = O^n^ . 
Summing over ^ i ^ j ^ i, we obtain that 

EN 2 = (EN) 2 + EN ■ l 5 nn- 4/5 = (1 + o(l))(EiV) 2 . 
The lower bound follows immediately. 
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